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Search means containing fixed-length addresses generated by a hash 
function 

Field of the invention 

5 The present invention relates to database searches and search 

means, especially used in an access point system composed of one or more 
servers, which are connected to each other by a intranet, and which handle 
data sent by client applications. 

1 0 Background of the invention 

Various client-server services are based on the use of access 
points, wherein clients are logged into a service and communicate with each 
other via a set of access points. Each client is an application program and a 
set of access points composes a logical server. Instant messaging services, 
15 such as a chat service and an e-mail service, are some examples of client- 
server services. A set of access points is connected to an access network 
that may be a fixed network or a wireless network. The access network is 
preferably a packet-switched network. 

The Internet is a packet-switched network whose nodes have an 
20 Internet protocol address (IP address). Each IP address consists of four 
numbers between 1 and 255, and dots separating each number; for example, 
193.199.35.5. The first number refers to the topmost network level and the 
second number refers to the next level, etc. The routers of the Internet locate 
the correct receiver by its IP address. Since the Internet is a packet-switched 
25 network, no circuit is allocated for the connection. Instead, data is transmitted 
in packets from the sender to the receiver. 

Each IP packet includes a header with the following information: a 
packet length, time to live (hop counter), protocol, a sender IP address, a 
receiver IP address, a sender port (application), among other things. The IP 
30 packet may carry the packets of a higher level protocol as a payload. Typical 
higher level protocols are the transmission control protocol (TCP) and the 
user datagram protocol (UDP). 

When using TCP/IP the bytes (octets) have a sequence number. 
Thus, a receiver node detects if one or more bytes are missing. Then it sends 
35 a retransmission request for missing bytes. In another protocol a receiver 
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may send an acknowledgement as response to receiving bytes. UDP is in 
some cases an alternative to TCP/IP. UDP implements means to send data- 
grams without any control protocol. Therefore retransmission requests or 
packet acknowledgements are not used in UDP. For the same reason a 
5 sender cannot know whether a receiver has received the packets sent. 

The world wide web (WWW or web) is an Internet-based, distrib- 
uted hypermedia information system. The web pages are traditionally repre- 
sented using hypertext markup language (HTML). HTML and its successor, 
extensible markup language (XML), are intended for forming structured 
10 documents to be interchanged in the web. Structured documents are 
searched for and read through software that is termed a browser. Hypertext 
transfer protocol (HTTP) determines how structured documents are trans- 
ferred in the Internet. 

As the Internet has become very popular, it has also been brought 
15 to mobile and wireless devices. Many of the prior art services presently in 
use are based on the global system for mobile communications (GSM) stan- 
dard. General packet radio services (GPRS) and the universal mobile tele- 
communications system (UMTS) are third-generation mobile communication 
systems which will replace second-generation mobile communication sys- 
20 terns, such as GSM. 

Wireless markup language (WML) is a formal language that allows 
the text portions of structured documents to be presented via a wireless 
network on wireless devices. WML is a part of wireless application protocol 
(WAP). WAP is similar to TCP/IP based protocols enabling Internet in wire- 
25 less devices. 

In addition to TCP/IP and UDP packets, short messages are a 
method to transmit data from a client to an access point. GSM limits the 
length of short messages to 160 characters. Multimedia messaging service 
(MMS) is able to deliver larger messages in a reasonable time compared to 
30 SMS. The fixed limit will be replaced by an ability to not only transfer much 
larger text contents, and graphics, but also audio or video clips. 

Thus, the access network may be a fixed network or a wireless 
network, such as a GSM, GPRS, or UMTS. Or the access network may be a 
wireless local area network (WLAN). Clients send data via the access net- 
35 work to a set of access points. Data may be located, for example, in a 
TCP/IP packet, an UDP packet, or in a short message. 
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When a client is logged into a service and sends data to an access 
point, the data contains at least one reference address. The said reference 
address identifies a sender, a receiver, or a service. The reference address 
may be a fully qualified domain name (FQDN), or it may be an MSISDN 
5 number, i.e. a mobile subscriber integrated service digital network number. 

Thus, data sent by a client may include one ore more reference 
addresses which are, for example, MSISDN numbers or fully qualified do- 
main names. An e-mail address, such as Alfa@wiral.com, is one example of 
an FQDN, but there are also other types of domain names. 
10 The Internet consists of thousands of domains. Each domain has 

a domain name which is mapped to a certain IP-address. Several domain 
names may be mapped to the same IP-address. For example, domain 
names www.jypoly.fi and jkolamk.jkol.jypoly.fi are mapped to IP-address 
193.199.35.1. Conversely, fully qualified domain names are unique, such as 
15 e-mail user names. 

Domain names compose a hierarchical domain name system 
(DNS). The root of a DNS tree is nameless. Top-level domains are under the 
root: the original three-letter domains are .com, .net, .org, .edu, .int, .mil and 
.gov, plus two-letter top-level domains for each country. Under the top-level 
20 domains there are lower domains connected to the Internet. The Internet 
includes domain name servers mapping domain names to IP addresses. 

Uniform resource locator (URL) is a system uniquely identifying 
each resource in the Internet, i.e. where each document or file is located. 

A URL address consists of a domain name and a search path. For 
25 example, URL address "www.jypoly.fi/internet/jamk.nsf consists of domain 
name "www.jypoly.fi" and search path "/internet/jamk.nsf". 

A URL request consists of a protocol part and a URL address. For 
example, in the following URL request the protocol part is "http://" and the 
URL address is the before-mentioned "www.jypoly.fi/internet/jamk.nsf: 

30 http://www.jypoly.fi/internet/jamk.nsf 

A uniform resource identifier (URI) is an access path to a certain 
piece of information. A URI always contains a URL. For example, a URI may 
be an access path to a WWW page: 

http://www.jypoly.fi/internet/jamk.nsf/www/779EBAC7D6A67A14C 
35 22567E7002B127A?OpenDocument 
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FIG. 1 shows a client-server system composed of one access 
point, wherein client Alfa 11 and client Beta 12 are logged into a service. 
Client Alfa and Beta are applications having their own reference addresses, 
such as e-mail addresses. During the use of the service client Alfa sends 
5 data via an access network 13 to client Beta's reference address. An access 
point 14 receives the packets and transmits them to client Beta. In FIG. 1 
client Alfa is located in a mobile phone and client Beta is located in a laptop. 
In addition to these devices, a client could be located in, for example, a per- 
sonal digital assistant (PDA), a personal computer, or a network server. 

10 For example, Jabber IM server described in http://www.jabber.org 

can be used as an access point for instant messaging (IM) services. How- 
ever, when a system should have high capacity, a set of access points is 
needed to handle data sent by clients. The access points can be cou- 
pled/connected to each other by means of an intranet. 

15 The invention relates to a system composed of at least one server 

which could be e.g. a database server, or an access point. A system may 
include several access points when access points are coupled, directly or 
indirectly to each other, for example, by using an intranet of 100 Mbps 
Ethernet Also a relative high capacity network may be blocked because of 

20 high load. Blocking of a network is one drawback of the prior art. Patent ap- 
plication PCTxxxxxxxxxx contains solutions for the said drawback. 

Another drawback of the prior art concerns search means that are 
used in a server for database or file searches. The search means are the 
subject of this patent application. 

25 A database is a collection of data organized in a fashion that facili- 

ties updating, retrieving, and managing the data. The data may consist of 
anything, including, but not limited to reference addresses. Various search 
means can be used for database retrievals. Lists, tree data structures, and 
hashing methods are typical examples of search means. 

30 A key is a value intended for searching for a certain data collection 

in a database. Typically a big database cannot be entirely located in a main 
memory. Various data structures have been designed and implemented to 
index data collections of a database. If the content of a database is change- 
less, the fastest search means/method is binary search. When a database 

35 includes n keys the processing time of the search operation is Oflog n). 
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A binary tree is a data structure based on binary search. If keys 
are short and of a fixed-length, such as social security numbers, the keys are 
preferably located in the nodes of a binary tree. However, if a database is big 
and keys are long the keys must be located in disk memory, which makes 
5 database searches very slow because each comparison operation causes a 
disk access. 

FIG. 2 illustrates using a binary tree as a search means. As shown 
in FIG. 2, each node of a binary tree includes, a left and right link to other 
nodes, and a link to a key record. For example, node 201 includes a left link 

10 202 to node 203, a right link 204 to node 205, and a link 206 to a key record 
207. The key record 207 includes a key 208 and a link 209 to a data collec- 
tion 210. In FIG. 2 a search key 210 is "Mike" and the key 208 related o die 
root node 201 of the binary tree is "Lisa". First the link 206 is followed to 
obtain the key 208 and then the kevs 210 and 208 are compared. Because 

15 Mike alphabetically succeeds Lisa, the right link 204 of the node is followed 
to obtain the next node 205. The node 205 has a link to a key record 212 
including the key "Mike", i.e. in this case the search key is found from the 
binary tree. We may suppose that the binary tree and key records can be 
kept in main memory. Then a comparison operation does not cause a disk 

20 access. Still following links and comparing various-length keys essentially 
increases the processing time of the search operation. 

Hashing is another search means. It is based on the use of a hash 
function that inputs a character string related to a data collection and outputs 
a numeric value. If the character string is a unique value, the numeric value 

25 should be, too. The numeric value determines a bucket in which the said data 
collection is stored. Each bucket is composed of a fixed number of ce' r so 
that each cell includes one data collection. Buckets are usually store:: ,-. a 
disk memory. An adequate hash function spreads data collections uniformly 
into buckets. In that case a search operation requires one disk access in 

30 which a certain bucket is read in main memory. However, hashing may fail so 
that a lot of buckets are empty and simultaneously some buckets are over- 
flowed. Overflow means that all the cells of a bucket are in use. In that case 
the bucket is usually chained to one or more data collections located in an 
overflow area. This increases the number of disk accesses and the process- 

35 ing time of a search operation. 
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Especially an FQDN type of reference address, which includes an 
URL, may be very long. The handling of long reference address takes more 
time than handling of shorter reference addresses, of course. Anyway, vari- 
ous-length reference addresses slow down database and file searches. 

5 

Summary of the invention 

The objective of the invention is to upgrade the search means of a 
file or a database when keys are of various-length and possibly long. The 
keys may be reference addresses, such as fully qualified domain names. 

10 The objective is reached by combining the use of a hash function 

with an appropriate data structure. The hash function is used for generating 
fixed-length addresses, which are located in the nodes of the data structure. 
The hash function is preferably Digest and the data structure is, for example, 
a B-tree or skiplist. Another hash function and other data structures can be 

15 used, too. 

The hash function should input a various-length reference address 
and it should output a relatively short fixed-length address. In addition, the 
fixed-length address should be unique with high probability. Because the 
fixed-length addresses generated are short, they can be located in the nodes 
20 of a B-tree and the size of the B-tree is still so small that the B-tree can be 
kept in the main memory. 

The inventive search means can be utilized in various database 
systems and especially in access point systems. 

25 Brief description of the drawings 

The invention is described more closely with reference to the ac- 
companying drawings, in which 



Figure 1 shows an access point communicating with clients, 
30 Figure 2 illustrates using a binary tree as a search, means, 

Figure 3 shows a binary tree whose nodes include a fixed-length address, 
Figure 4 shows a B-tree including fixed-length addresses, 
Figure 5 depicts a hash function and its input and output, 
Figure 6 shows an example of an access point system, 
35 Figure 7A shows an example of a database system, 

Figure 7B shows a search means for uniform resources identifiers. 
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Detailed description of the invention 

Usually, a data collection is considered to a have just one unique 
piece of information which is termed a primary key. The data collection may 
include another unique piece of information that can be used as a secondary 
5 key. There may be several secondary keys so that a primary key and each 
secondary key are related to a certain search means. Search means may 
differ from each other. We have used the term "reference address" along with 
the term "key". A reference address may or may not be unique. 

The records of a search means may or may not be termed nodes 

1 0 and those nodes may or may not compose a data structure which is termed a 
tree or a list in the prior art. 

FIG. 3 shows a binary tree each of whose nodes includes a fixed- 
length address that is generated from a reference address by using a hash 
function. The binary tree is balanced as the binary tree shown in FIG. 2 and 

15 both the trees include the same number of nodes. In each node 301, 302, 
and 303 is located a fixed-length address, and these fixed-length addresses 
are marked with 304, 305, and 306. The fixed-length address 304 is gener- 
ated from the reference address "Lisa" and the fixed-length address 305 is 
generated from the reference address "Mike". When the search key is "Mike", 

20 the search key is generated by the same hash function that has been used to 
generate fixed-length addresses 304, 305, 306, and the other fixed-length 
addresses are located in the binary tree. The binary tree is used as a search 
means as follows. First the search address generated is compared to the 
fixed-length address located in the node 301. The search address is the 

25 bigger one, thus the right link of the node 301 is used to obtain the node 302. 
Then the search address is compared to a fixed-length address located in the 
node 302. Now the search address and fixed-length address match, thus a 
data collection related to the search key "Mike" is obtained from the database 
307 by using the link 308 of the node 302. 

30 Digest hash function, a skiplist, and a B-tree are all known in the 

prior art. Digest hash function, or in more specific, the MD5 message-Digest 
algorithm is described in RFC1321 published by the Internet engineering task 
force (IETF). Digest hash function may result in a non-unique fixed-length 
address, though its input, i.e. a reference address, would be unique. How- 

35 ever, the probability that Digest hash function results in a non-unique fixed- 
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length address is very small when reference addresses are unique. The said 
probability is only (1/1 6) 32 . 

If required, a generated fixed-length address can be searched in a 
data structure where active fixed-length addresses are stored in. If the said 
5 address is found in the data structure, a new fixed-length address is gener- 
ated until the address is not found in the data structure. This way it is possi- 
ble to ensure that all active fixed-length addresses are unique. 

The structure and use of a skiplist is described in Communications 
of the ACM, 33(6):668-676, June 1990. 

10 A B-tree belongs to a set of data structures termed "balanced 

trees". A red-black tree and an AVL-tree are other examples of balanced 
trees. A binary tree and balanced trees are basic data structures that are 
taught in the literature of computer science. 

FIG. 4 shows a B-tree including fixed-length addresses. Numbers 

15 from 2 to 37 represent 16 bytes long fixed-length addresses. In this example, 
the B-tree is composed of five nodes. The nodes of the B-tree do not need to 
contain the same number of fixed-length addresses, more generally, the 
nodes of the B-tree do not need to contain the same number of nodes. As in 
FIG. 3, in addition to a fixed-length address, a node of the B-tree typically 

20 includes a link to a certain data collection stored in a database. These links 
and the database are omitted from FIG. 4. 

In addition to the data structures shown in FIG. 3 and 4, there are 
a number of data structures where operation as a search means can be 
upgraded by placing fixed-length addresses in the nodes of the data struc- 

25 tures. 

FIG. 5 depicts a hash function and its input and output. The hash 
function 52 reads a relative long reference address 51 as a parameter and 
outputs a relative short fixed-length address 53. A reference address may be 
1-500 bytes long, when a fixed-length address is preferably 16 bytes long. 
30 The hash function is preferably Digest, but another appropriate hash function 
can be used. The hash function should output a fixed-length address that is 
unique with very high probability. 

An inventive data structure can be implemented as follows. A set 
of data collections is stored in a database or file, wherein each data collection 
35 contains at least one reference address. The data collections are passed 
through one by one as follows: 1) the reference address is obtained from a 
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data collection, 2) the fixed-length address is generated by applying a hash 
function to the reference address, 3) the fixed-length address is located in a 
node of the data structure, and 4) the pointing means of the node are set to 
point to the data collection containing the reference address from which the 
5 fixed-length address located in the node was generated. The pointing means 
may be e.g. as a pointer, link or index. The node may also include at least 
another pointing means. The said pointing means can be set to point to a 
certain node of the data structure depending on the category of the data 
structure and the values of the fixed-length address already contained in the 

10 data structure. The data structure category may be e.g. a B-tree or skiplist. 

The invention is not limited to any specific type of databases or 
files, i.e. it is very general-purpose. The search means is especially useful in 
instant messaging, thus instant messaging is next discussed in more detail. 

In an instant messaging service data sent by a client contains at 

15 least one reference address. The reference address could be e.g. the re- 
ceiver's e-mail address, such as beta@wiral.com. The reference address is 
inputted as a parameter to a hash function which generates a 16 bytes long 
fixed-length address. The fixed-length address replaces the reference ad- 
dress in the data part of the IP-packet that is transmitted in the intranet. The 

20 intranet is preferably an Ethernet network and the packets to be transmitted 
in the intranet are preferably IP packets. 

FIG. 6 shows a system that contains three access points, a load 
balancer, and a gateway. Clients Alfa 61 and Beta 62 can communicate with 
each other through the said system. The clients may send data via an access 

25 network 63 to a load balancer 64. The load balancer takes care that the three 
access points, 65, 66, and 67 are uniformly loaded. The three access points 
and a gateway 68 are connected to an intranet 69. When client Alfa logs into 
the system, we may suppose that the access point 65 generates a fixed- 
length address for client Alfa and locates the fixed-length address in a node. 

30 The node is added to a search means used by the access point 65. Corre- 
spondingly, when client Beta logs into the system, the access point 67 gen- 
erates a fixed-length address for client Beta and locates the fixed-length 
address in a node. The node is added to another search means used by the 
access point 67. Client Alfa sends data containing a reference address, 

35 wherein the data could be e.g. an instant message. The access point 65 
obtains the reference address from the data and generates a fixed-length 
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address. Then the access point 65 locates the fixed-length address and the 
payload in an IP-packet and sends the IP-packet to the intranet 69 using one- 
to-many transmission method, i.e. broadcast, multicast or anycast methods. 
All access points receive the IP-packet, obtain the fixed-length address from 
5 it, and searches the fixed-length address from the memory by using the 
search means. If the said address is found, i.e. the fixed-length address is 
the same as the access point 67 generated for client Beta, the data sent by 
client Alfa is transmitted to client Beta. In this use case, clients Alfa and Beta 
were logged in the same system or domain. If they would be logged in differ- 

10 ent systems, they would communicate via the gateway 68. 

When an access point or a gateway receives a packet from the 
intranet, the access point or the gateway searches the fixed-length address 
from the memory by using the search means. 

A packet to be transmitted in an intranet may include one, two, or 

15 more fixed-length addresses. If the packet includes at least two fixed-length 
addresses, one of the addresses may cause a predefined operation in a 
receiving access point or a gateway. The predefined operation may be, for 
example, the comparing of domain names as described above. 

We may suppose that a fixed-length address is usually generated 

20 from one piece of information that identifies e.g. a sender, a receiver, or a 
service. However, a certain predefined combination of pieces of information 
could be one reference address and that reference address is inputted as a 
parameter to a hash function. 

For example, a reference address could be composed of a name 

25 and a home address. Thus, a character string "Clay Saviranta, Luutnantintie 
10 E 62, 00410 Helsinki, Finland" could be a reference address related to a 
certain person. The fixed-length address generated from this reference ad- 
dress could be a pointer to e.g. a health care database, so that the data 
collection of the said person is obtainable through the pointer. A fixed-length 

30 address generated from a name and a home address may be useful if a 
unique key, such as a social security number, is not available. A person's 
name or home address is alone an unreliable key, for example, to a health 
care database, because two or more persons may have the same name and 
members of a family usually have the same home address. 
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The combination of pieces of information may result in a longer 
reference address from which it is more reliable to generate a unique fixed- 
length address than one piece of information. 

The combination of pieces of information can also be used to save 
5 the processing capacity of a server. This is possible, for example, in the 
following case. Let us suppose that 1) the server has access to connection 
information including sender and receiver information and 2) the sender and 
receiver information is reference addresses, both of them causing a search 
operation in the server. The sender information could be Alfa@wiral.com and 

10 the receiver information Beta@wiral.com. The server joins these character 
strings resulting in a reference address "AIfa@wiraLcomBeta@wiral.com". 
Then the server generates a fixed-length address by applying an appropriate 
hash function to the said reference address. Hereafter the server creates a 
node including the fixed-length address and adds the node to the search 

15 means. During a connection, the sender sends messages to the receiver, 
wherein the messages contain the sender and receiver information. The 
server joins the sender and receiver information to one reference address, 
generates a fixed-length address for the said reference address, and uses 
the search means to find the fixed-length address. Thus, the server executes 

20 one search operation instead of two search operations. 

The search means can be utilized in various client-server systems, 
such as an access point system show in FIG. 6. 

FIG. 7A shows a client-server system in which a client 701 is a 
browser installed in a terminal 702 and a server 703 is software operating in 

25 a WWW server 704. The WWW server contains thousands of WWW pages 
stored in a database 705. The client 701 and the server 703 communicate via 
an access network 706. The client sends a URL/URI request concerning a 
WWW page to the server and the server sends the WWW page requested to 
the client. 

30 FIG. 7B shows what happens in the server 703 when it receives 

the URL/URI request. The server is equipped with a search means including 
nodes 707 and 708 and, of course, a number of other nodes. Files 709 and 
710 are stored in the database 705. The file 709 includes a reference ad- 
dress 712 from which a fixed-length address 713 is generated. The file 710 

35 includes another reference address 714 from which another fixed-length 
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address 715 is generated. Let us suppose that the URL/URI request of the 
client includes the following reference address: 

http://www.wiral.com/w_2/index.php7pgroupsproducts 
The server 703 must find out whether the database includes a 
5 WWW page related to the reference address and if it does, send the WWW 
page to the client. The server generates a search address by applying a hash 
function to the reference address and uses the search means. In this case, 
the search address is found in the node 708 of the search means. The node 
708 includes a pointer 716 pointing to the file that includes the reference 
1 0 address and the content of the WWW page requested. 
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Claims 

1 . A data structure for data collection searches from a set of data 
collections stored in a memory, a data collection of the set being related to at 
least one reference address, the data structure containing nodes each of 

5 which includes 

a pointing means for accessing a certain data collection from the 

memory, 

characterized in that each of said nodes further includes 
a fixed-length address which is generated by applying a hash 
10 function to a reference address related to the certain data collection. 

2. The data structure as defined in claim ^characterized in 
that each of said nodes further includes at least one 

pointing means adapted to link a node of the data structure to at 
least one other node of the data structure. 
1 5 3. The data structure as defined in claim ^characterized in 

that the data structure is a tree. 

4. The data structure as defined in claim 3, characterized in 
that the data structure is a B-tree. 

5. The data structure as defined in claim ^characterized in 
20 that the data structure is a skiplist. 

6. The data structure as defined in claim 1, characterized in 
that the hash function is essentially based on the MD5 message-Digest algo- 
rithm. 

7. The data structure as defined in claim ^characterized in 
25 that the reference address is a unique piece of information among reference 

addresses related to the set of data collections. 

8. The data structure as defined in claim ^characterized in 
that the reference address contains at least two pieces of information related 
to the certain data collection. 

30 9. The data structure as defined in claim 1,characterized in 

that the reference address is of various length. 

10. The data structure as defined in claim 1, characterized 
in that a data collection search is based on comparing a search address to 
the fixed-length address located in the node, wherein the search address is 

35 generated by applying a hash function to a search key. 
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11. A method for creating a data structure which is intended for 
searches from a set of data collections stored in a memory, a data collection 
of the set containing at least one reference address, each node of the data 
structure including 

5 a pointing means for accessing a data collection from the memory, 

characterized in that each of said nodes further includes a 
fixed-length address, the method containing the steps of: 

receiving a data collection to be added to the set, 
generating a fixed-length address by applying a hash function to a 
10 reference address obtained from the data collection, 

locating the fixed-length address in a node of the data structure, 

storing the data collection in the memory, and 

setting a pointing means of the node to point to the data collection. 

12. The method as defined in claim 11, characterized by the 
1 5 further steps of: 

in a case where the node includes pointing means for linkage, 

using said pointing means in accordance with a cate- 
gory of the data structure. 

13. The method as defined in claim 11, characterized in that 
20 the category is a tree. 

14. The method as defined in claim 13, characterized in that 
the category is a B-tree. 

15. The method as defined in claim 11, characterized in that 
the category is a skiplist. 

25 16. The method as defined in claim 11, characterized in that 

the hash function is essentially based on the MD5 message-Digest algorithm. 

17. The method defined in claim 11, characterized in that 

the reference address is a unique piece of information among reference 

addresses related to the set of data collections. 
30 18. The method as defined in claim 11, characterized in that 

the reference address contains at least two pieces of information related to 

the data collection. 



WO 03/081461 



PCT/FI02/00257 



15 



19. A server for data collection searches applying to a set of data 
collections stored in the server, a data collection of the set being related to at 
least one reference address, 

characterized in that the server includes at least one search 
5 means using records stored in memory each of which includes 

a pointing means for accessing a certain data collection, and 
a fixed-length address which is generated by applying a hash 
function to a reference address related to the certain data collection. 

20. The server as defined in claim 19, characterized in that 
10 each of said records further includes at least one pointing means for linking 

the record to another record. 

21. The server as defined in claim 19, characterized in that 
the records compose a tree data structure. 

22. The server as defined in claim 21,characterized in that 
15 the records compose a B-tree. 

23. The server as defined in claim 19, characterized in that 
the records compose a skiplist. 

24. The server as defined in claim 19, characterized in that 
the hash function is essentially based on the MD5 message-Digest algorithm. 

20 25. The server as defined in claim 19, characterized in that 

the reference address is a unique piece of information among reference 
addresses related to the set of data collections. 

26. The server as defined in claim 19, characterized in that 
the reference address contains at least two pieces of information related to 

25 the data collection. 

27. The server as defined in claim 19, characterized in that 
the reference address is a mobile subscriber integrated service digital net- 
work number (MSISDN). 

28. The server as defined in claim 19, characterized in that 
30 the reference address is a fully qualified domain name (FQDN). 

29. The server as defined in claim 19, characterized in that 
the reference address is a domain name. 

30. The server as defined in claim 19, characterized in that 
the reference address contains a uniform resource locator (URL). 

35 31 . The server as defined in claim 19, characterized in that 

the server is coupled to at least one network. 
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32. The server as defined in claim 31,characterized in that 
the network is an access network. 

33. The server as defined in claim 31, characterized in that 
the network is an intranet. 

5 34. The server as defined in claim 31, characterized in that 

the server is further adapted to: 

receive data from a client, said data containing at least one refer- 
ence address, 

generate a fixed-length address by applying a hash function to the 
10 reference address obtained from the data, and 

perform a predetermined operation using the fixed-length address 

generated. 

35. The server as defined in claim 34, c h a r a c t e r i z e d in that 
to perform the predetermined operation the server is adapted to: 

15 search the fixed-length address from the records stored in the 

memory, and when found, 

transmit the data to another client. 

36. The server as defined in claim 34, c h a r a c t e r i z e d in that 
to perform the predetermined operation the server is adapted to: 

20 search the fixed-length address from the records stored in the 

memory, and when found, 

transmit response data to the client. 

37. The server as defined in claim 34, c h a r a c t e r i z e d in that 
to perform the predetermined operation the server is adapted to: 

25 locate at least the first fixed-length address in a packet containing 

the data and 

send the packet to the intranet. 

38. The server as defined in claim 33, c h a r a c t e r i z e d in that 
the server is adapted to: 

30 receive a packet via the intranet, the packet containing a fixed- 

length address, 

search the fixed-length address from the records stored in the 
memory, and when found, 

perform a predetermined operation. 

35 
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39. The server as defined in claim 38, c h a r a c t e r i z e d in that 
to perform the predetermined operation the server is adapted to: 

search the fixed-length address from the records stored in the 
memory, and when found, 
5 transmit the data to a certain client. 

40. The server as defined in claim 38, c h a r a c t e r i z e d in that 
to perform the predetermined operation the server is adapted to: 

search the fixed-length address from the records stored in the 
memory, and when found, 
1 0 transmit the data to a certain gateway. 

41. The server as defined in claim 35, characterized in that 
the server is a service access point. 

42. The server as defined in claim 37 and 38, characterized 
in that the server is a service access point coupled with the intranet to at 

1 5 least one other server. 

43. The server as defined in claim 40, c h a r a c t e r i z e d in that 
the server is a gateway. 

44. The server as defined in claim 34, c h a r a c t e r i z e d in that 
the data from the client is an instant message. 

20 45. The server as defined in claim 36, characterized in that 

the server is a database server. 

46. The server as defined in claim 34 and 45, characterized 
in that the data from the client is a database request. 

47. The server as defined in claim 36, c h a r a c t e r i z e d in that 
25 the server is a WWW server. 

48. The server as defined in claim 34 and 47, characterized 
in that the data from the client is a uniform resource locator (URL) request. 
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